Picture for Xiaogeng Liu

Xiaogeng Liu

SafeGen-Bench: Benchmarking Safety in Image-Conditioned Text-to-Video Generation

Add code
May 31, 2026
Viaarxiv icon

When Are Teacher Tokens Reliable? Position-Weighted On-Policy Self-Distillation for Reasoning

Add code
May 20, 2026
Viaarxiv icon

DecodingTrust-Agent Platform (DTap): A Controllable and Interactive Red-Teaming Platform for AI Agents

Add code
May 06, 2026
Viaarxiv icon

Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution

Add code
Mar 25, 2026
Viaarxiv icon

ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention

Add code
Mar 23, 2026
Viaarxiv icon

ReasoningBomb: A Stealthy Denial-of-Service Attack by Inducing Pathologically Long Reasoning in Large Reasoning Models

Add code
Jan 29, 2026
Viaarxiv icon

MetaAgent: Automatically Constructing Multi-Agent Systems Based on Finite State Machines

Add code
Jul 30, 2025
Viaarxiv icon

DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agents

Add code
Jun 13, 2025
Viaarxiv icon

OET: Optimization-based prompt injection Evaluation Toolkit

Add code
May 01, 2025
Viaarxiv icon

Doxing via the Lens: Revealing Privacy Leakage in Image Geolocation for Agentic Multi-Modal Large Reasoning Model

Add code
Apr 29, 2025
Viaarxiv icon